Search CORE

37 research outputs found

Influent generator : towards realistic modelling of wastewater flowrate and water quality using machine-learning methods

Author: Li Feiyi
Publication venue
Publication date
Field of study

Depuis que l'assainissement des eaux usées est reconnu comme un des objectifs de développement durable des Nations Unies, le traitement et la gestion des eaux usées sont devenus plus importants que jamais. La modélisation et la digitalisation des stations de récupération des ressources de l'eau (StaRRE) jouent un rôle important depuis des décennies, cependant, le manque de données disponibles sur les affluents entrave le développement de la modélisation de StaRRE. Cette thèse vis e à faire progresser la modélisation des systèmes d'assainissement en général, et en particulier en ce qui concerne la génération dynamique des affluents. Dans cette étude, différents générateurs d'affluent (GA), qui peuvent fournir un profil d'affluent dynamique, ont été proposés, optimisés et discutés. Les GA développés ne se concentrent pas seulement sur le débit, les solides en suspension et la matière organique, mais également sur les substances nutritives telles que l'azote et le phosphore. En outre, cette étude vise à adapter les GA à différentes applications en fonction des différentes exigences de modélisation. Afin d'évaluer les performances des GA d'un point de vue général, une série de critères d'évaluation de la qualité du modèle est décrite. Premièrement, pour comprendre la dynamique des affluents, une procédure de caractérisation des affluents a été développée et testée pour une étude de cas à l'échelle pilote. Ensuite, pour générer différentes séries temporelles d'affluent, un premier GA a été développé. La méthodologie de modélisation est basée sur l'apprentissage automatique en raison de ses calculs rapides, de sa précision et de sa capacité à traiter les mégadonnées. De plus, diverses versions de ce GA ont été appliquées pour différents cas optimisées en fonction des disponibilités d'études et ont été des données (la fréquence et l'horizon temporel), des objectifs et des exigences de précision. Les résultats démontrent que : i) le modèle GA proposé peut être utilisé pour générer d'affluents dynamiques réalistes pour différents objectifs, et les séries temporelles résultantes incluent à la fois le débit et la concentration de polluants avec une bonne précision et distribution statistique; ii) les GA sont flexibles, ce qui permet de les améliorer selon différents objectifs d'optimisation; iii) les GA ont été développés en considérant l'équilibre entre les efforts de modélisation, la collecte de données requise et les performances du modèle. Basé sur les perspectives de modélisation des StaRRE, l'analyse des procédés et la modélisation prévisionnelle, les modèles de GA dynamiques peuvent fournir aux concepteurs et aux modélisateurs un profil d'affluent complet et réaliste, ce qui permet de surmonter les obstacles liés au manque de données d'affluent. Par conséquent, cette étude a démontré l'utilité des GA et a fait avancer la modélisation des StaRRE en focalisant sur l'application de méthodologies d'exploration de données et d'apprentissage automatique. Les GA peuvent donc être utilisés comme outil puissant pour la modélisation des StaRRE, avec des applications pour l'amélioration de la configuration de traitement, la conception de procédés, ainsi que la gestion et la prise de décision stratégique. Les GA peuvent ainsi contribuer au développement de jumeaux numériques pour les StaRRE, soit des système intelligent et automatisé de décision et de contrôle.Since wastewater sanitation is acknowledged as one of the sustainable development goals of the United Nations, wastewater treatment and management have been more important then ever. Water Resource Recovery Facility (WRRF) modelling and digitalization have been playing an important role since decades, however, the lack of available influent data still hampers WRRF model development. This dissertation aims at advancing the field of wastewater systems modelling in general, and in particular with respect to the dynamic influent generation. In this study, different WRRF influent generators (IG), that can provide a dynamic influent flow and pollutant concentration profile, have been proposed, optimized and discussed. The developed IGs are not only focusing on flowrate, suspended solids, and organic matter, but also on nutrients such as nitrogen and phosphorus. The study further aimed at adapting the IGs to different case studies, so that future users feel comfortable to apply different IG versions according to different modelling requirements. In order to evaluate the IG performance from a general perspective, a series of criteria for evaluating the model quality were evaluated. Firstly, to understand the influent dynamics, a procedure of influent characterization has been developed and experimented at pilot scale. Then, to generate different realizations of the influent time series, the first IG was developed and a data-driven modelling approach chosen, because of its fast calculations, its precision and its capacity of handling big data. Furthermore, different realizations of IGs were applied to different case studies and were optimized for different data availabilities (frequency and time horizon), objectives, and modelling precision requirements. The overall results indicate that: i) the proposed IG model can be used to generate realistic dynamic influent time series for different case studies, including both flowrate and pollutant concentrations with good precision and statistical distribution; ii) the proposed IG is flexible and can be improved for different optimization objectives; iii) the IG model has been developed by considering the balance between modelling efforts, data collection requirements and model performance. Based on future perspectives of WRRF process modelling, process analysis, and forecasting, the dynamic IG model can provide designers and modellers with a complete and realistic influent profile and this overcomes the often-occurring barrier of shortage of influent data for modelling. Therefore, this study demonstrated the IGs' usefulness for advanced WRRF modelling focusing on the application of data mining and machine learning methodologies. It is expected to be widely used as a powerful tool for WRRF modelling, improving treatment configurations and process designs, management and strategic decision-making, such as when transforming a conventional WRRF to a digital twin that can be used as an intelligent and automated system

CorpusUL

A jet tagging algorithm of graph network with HaarPooling message passing

Author: Li Wei
Liu Feiyi
Ma Fei
Publication venue
Publication date: 14/08/2023
Field of study

Recently methods of graph neural networks (GNNs) have been applied to solving the problems in high energy physics (HEP) and have shown its great potential for quark-gluon tagging with graph representation of jet events. In this paper, we introduce an approach of GNNs combined with a HaarPooling operation to analyze the events, called HaarPooling Message Passing neural network (HMPNet). In HMPNet, HaarPooling not only extracts the features of graph, but embeds additional information obtained by clustering of k-means of different particle features. We construct Haarpooling from five different features: absolute energy

\log E

, transverse momentum

\log p_T

, relative coordinates

(\Delta\eta,\Delta\phi)

, the mixed ones

(\log E, \log p_T)

and

(\log E, \log p_T, \Delta\eta,\Delta\phi)

. The results show that an appropriate selection of information for HaarPooling enhances the accuracy of quark-gluon tagging, as adding extra information of

\log P_T

to the HMPNet outperforms all the others, whereas adding relative coordinates information

(\Delta\eta,\Delta\phi)

is not very effective. This implies that by adding effective particle features from HaarPooling can achieve much better results than solely pure message passing neutral network (MPNN) can do, which demonstrates significant improvement of feature extraction via the pooling process. Finally we compare the HMPNet study, ordering by

p_T

, with other studies and prove that the HMPNet is also a good choice of GNN algorithms for jet tagging

arXiv.org e-Print Archive

Increased transgene expression mediated by recombinant adeno-associated virus in human neuroglia cells under microgravity conditions

Author: Deng Yulin
Li Yali
Ma Chengwei
Ma Hong
Sun Feiyi
Zhang Jiewen
Zhang Lan
Zhang Yaxi
Publication venue: Lorem Ipsum Press
Publication date: 01/08/2016
Field of study

The space environment has the special characteristics of radiation, noise particularity and weightlessness, all of which have adverse effects on astronauts’ muscles, bones, neurons and immune system. Some reports have shown that chemotherapy and radiotherapy can increase the activity of the recombinant adeno-associated virus (AAV) which is widely used in gene therapy. In this paper, recombinant AAV2 (rAAV2) was first packaged with the enhanced green fluorescence protein (eGFP) gene and used to infect neuroglia cells including the U87 and U251 cell lines, under microgravity conditions; it was then detected by fluorescence microscopy and flow cytometry. The results show that microgravity affects the adhesion ability of cells, promotes transgene expression induced by rAAV2 and causes changes of viral infection receptors at different time points. These findings broaden the current understanding of the microgravity effects on rAAV, with significant implications in gene therapy and the mechanisms of increased virus pathogenicity under space microgravity.

Journal of Molecular Biochemistry

Study of phase transition of Potts model with DANN

Author: Chen Shiyang
Chen Xiangna
Deng Weibing
Li Wei
Liu Feiyi
Papp Gabor
Shen Jianmin
Yang Chunbin
Publication venue
Publication date: 08/09/2022
Field of study

A transfer learning method, domain adversarial neural network (DANN), is introduced to study the phase transition of two-dimensional q-state Potts model. With the DANN, we only need to choose a few labeled configurations automatically as input data, then the critical points can be obtained after training the algorithm. By an additional iterative process, the critical points can be captured to comparable accuracy to Monte Carlo simulations as we demonstrate it for q = 3, 5, 7 and 10. The type of phase transition (first or second-order) is also determined at the same time. Meanwhile, for the second-order phase transition at q = 3, we can calculate the critical exponent

\nu

by data collapse. Furthermore, compared with the traditional supervised learning, the DANN is of higher accuracy with lower cost.Comment: 25 pages, 23 figure

arXiv.org e-Print Archive

Machine Learning of Pair-Contact Process with Diffusion

Author: Chen Shiyang
Deng Shengfeng
Li Wei
Liu Feiyi
Shen Jianmin
Xu Dian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

The pair-contact process with diffusion (PCPD), a generalized model of the ordinary pair-contact process (PCP) without diffusion, exhibits a continuous absorbing phase transition. Unlike the PCP, whose nature of phase transition is clearly classified into the directed percolation (DP) universality class, the model of PCPD has been controversially discussed since its infancy. To our best knowledge, there is so far no consensus on whether the phase transition of the PCPD falls into the unknown university classes or else conveys a new kind of non-equilibrium phase transition. In this paper, both unsupervised and supervised learning are employed to study the PCPD with scrutiny. Firstly, two unsupervised learning methods, principal component analysis (PCA) and autoencoder, are taken. Our results show that both methods can cluster the original configurations of the model and provide reasonable estimates of thresholds. Therefore, no matter whether the non-equilibrium lattice model is a random process of unitary (for instance the DP) or binary (for instance the PCP), or whether it contains the diffusion motion of particles, unsupervised leaning can capture the essential, hidden information. Beyond that, supervised learning is also applied to learning the PCPD at different diffusion rates. We proposed a more accurate numerical method to determine the spatial correlation exponent

\nu_{\perp}

, which, to a large degree, avoids the uncertainty of data collapses through naked eyes. Our extensive calculations reveal that

\nu_{\perp}

of PCPD depends continuously on the diffusion rate

D

, which supports the viewpoint that the PCPD may lead to a new type of absorbing phase transition.Comment: 15 pages, 11 figure

arXiv.org e-Print Archive

PubMed Central

Repository of the Academy's Library

A deep-learning-based approach for seismic surface-wave dispersion inversion (SfNet) with application to the Chinese mainlandKey points

Author: Feiyi Wang
Mengkui Li
Xiaodong Song
Publication venue: 'Elsevier BV'
Publication date: 01/04/2023
Field of study

Surface-wave tomography is an important and widely used method for imaging the crust and upper mantle velocity structure of the Earth. In this study, we proposed a deep learning (DL) method based on convolutional neural network (CNN), named SfNet, to derive the vS model from the Rayleigh wave phase and group velocity dispersion curves. Training a network model usually requires large amount of training datasets, which is labor-intensive and expensive to acquire. Here we relied on synthetics generated automatically from various spline-based vS models instead of directly using the existing vS models of an area to build the training dataset, which enhances the generalization of the DL method. In addition, we used a random sampling strategy of the dispersion periods in the training dataset, which alleviates the problem that the real data used must be sampled strictly according to the periods of training dataset. Tests using synthetic data demonstrate that the proposed method is much faster, and the results for the vS model are more accurate and robust than those of conventional methods. We applied our method to a dataset for the Chinese mainland and obtained a new reference velocity model of the Chinese continent (ChinaVs-DL1.0), which has smaller dispersion misfits than those from the traditional method. The high accuracy and efficiency of our DL approach makes it an important method for vS model inversions from large amounts of surface-wave dispersion data

Directory of Open Access Journals

Link prediction based on time-varied weight in co-authorship network

Author: Huang S
Li J
Tang Feiyi
Tang Y
Publication venue: IEEE
Publication date: 01/01/2014
Field of study

Crossref

Victoria University Eprints Repository

Complete chloroplast genome sequence of Salix sinopurpurea (Salicaceae)

Author: Enze Li
Feiyi Guo
Kangjia Liu
Yachao Wang
Zhenfeng Zhan
Zhixiang Zhang
Publication venue: Taylor & Francis Group
Publication date: 01/03/2021
Field of study

Salix sinopurpurea is a morphologically special shrubby willow characterizing opposite leaves. Here, we reported the complete chloroplast (cp) genome sequence of Salix sinopurpurea. The cp genome is 155,546 bp in length, including a large single-copy (LSC) region of 84,412 bp, a small single-copy (SSC) region of 16,216 bp, and a pair of inverted repeated regions of 27,459 bp. The cp genome of Salix sinopurpurea encodes 130 genes, including 85 protein-coding genes, 37 tRNA genes, and eight rRNA genes. Phylogenetic tree showed that Salix sinopurpurea is closely related to Salix psammophila and Salix suchowensis

Directory of Open Access Journals

Feasibility Research on Fish Pose Estimation Based on Rotating Box Object Detection

Author: Bin Lin
Chaoli Mou
Feiyi Li
Jiao Li
Kailin Jiang
Xinyao Gong
Xuliang Duan
Zhiqi Xu
Publication venue: 'MDPI AG'
Publication date: 19/11/2021
Field of study

A video-based method to quantify animal posture movement is a powerful way to analyze animal behavior. Both humans and fish can judge the physiological state through the skeleton framework. However, it is challenging for farmers to judge the breeding state in the complex underwater environment. Therefore, images can be transmitted by the underwater camera and monitored by a computer vision model. However, it lacks datasets in artificial intelligence and is unable to train deep neural networks. The main contributions of this paper include: (1) the world’s first fish posture database is established. 10 key points of each fish are manually marked. The fish flock images were taken in the experimental tank and 1000 single fish images were separated from the fish flock. (2) A two-stage attitude estimation model is used to detect fish key points. The evaluation of the algorithm performance indicates the precision of detection reaches 90.61%, F1-score reaches 90%, and Fps also reaches 23.26. We made a preliminary exploration on the pose estimation of fish and provided a feasible idea for fish pose estimation

Multidisciplinary Digital Publishing Institute

A Brainnetome Atlas Based Mild Cognitive Impairment Identification Using Hurst Exponent

Author: Bin Jing
Bo Li
Feiyi Cui
Hongwen Chen
Ru Guo
Tingting Wang
Zhuqing Long
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2018
Field of study

Mild cognitive impairment (MCI), which generally represents the transition state between normal aging and the early changes related to Alzheimer’s disease (AD), has drawn increasing attention from neuroscientists due that efficient AD treatments need early initiation ahead of irreversible brain tissue damage. Thus effective MCI identification methods are desperately needed, which may be of great importance for the clinical intervention of AD. In this article, the range scaled analysis, which could effectively detect the temporal complexity of a time series, was utilized to calculate the Hurst exponent (HE) of functional magnetic resonance imaging (fMRI) data at a voxel level from 64 MCI patients and 60 healthy controls (HCs). Then the average HE values of each region of interest (ROI) in brainnetome atlas were extracted and compared between MCI and HC. At last, the abnormal average HE values were adopted as the classification features for a proposed support vector machine (SVM) based identification algorithm, and the classification performance was estimated with leave-one-out cross-validation (LOOCV). Our results indicated 83.1% accuracy, 82.8% sensitivity and 83.3% specificity, and an area under curve of 0.88, suggesting that the HE index could serve as an effective feature for the MCI identification. Furthermore, the abnormal HE brain regions in MCI were predominately involved in left middle frontal gyrus, right hippocampus, bilateral parahippocampal gyrus, bilateral amygdala, left cingulate gyrus, left insular gyrus, left fusiform gyrus, left superior parietal gyrus, left orbital gyrus and left basal ganglia

Directory of Open Access Journals

Frontiers - Publisher Connector